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Abstract 

Background: The Severe Acute Respiratory Syndrome (SARS) was a newly emerged infectious 
disease which caused a global epidemic in 2002-2003. Sequence analysis of SARS-coronavirus 
isolates revealed that specific genotypes predominated at different periods of the epidemic. This 
information can be used as a footprint for tracing the epidemiology of infections and monitor viral 
evolution. However, direct sequencing analysis of a large number of clinical samples is cumbersome 
and time consuming. We present here a simple and rapid assay for the screening of SARS- 
coronavirus genotypes based on the use of fluorogenic oligonucleotide probes for allelic 
discrimination. 

Methods: Thirty SARS patients were recruited. Allelic discrimination assays were developed 
based on the use of fluorogenic oligonucleotide probes (TaqMan). Genotyping of the SARS- 
coronavirus isolates obtained from these patients were carried out by the allelic discrimination 
assays and confirmed by direct sequencing. 

Results: Genotyping based on the allelic discrimination assays were fully concordant with direct 
sequencing. All of the 30 SARS-coronavirus genotypes studied were characteristic of genotypes 
previously documented to be associated with the latter part of the epidemic. Seven of the isolates 
contained a previously reported major deletion but in patients not epidemiologically related to the 
previously studied cohort. 

Conclusion: We have developed a simple and accurate method for the characterization and 
screening of SARS-coronavirus genotypes. It is a promising tool for the study of epidemiological 
relationships between documented cases during an outbreak. 
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Background 

The Severe Acute Respiratory Syndrome (SARS) is a 
recently emerged infectious disease which led to a global 
epidemic between 2002 and 2003. A novel coronavirus 
(SARS-CoV) was identified as the causative agent [1]. The 
genomic sequence of the SARS-CoV was promptly charac¬ 
terized [2,3]- Thereafter, studies had focused on the early 
detection of SARS-CoV and the development of diagnostic 
tools [4-6]. Systematic analysis of the SARS-CoV sequence 
information have demonstrated that characteristic viral 
genotypes predominated at certain periods during the 
course of the outbreak [7-10]. Furthermore, characteriza¬ 
tion of the viral sequences have been shown to be a useful 
tool for confirming epidemiological associations between 
infected individuals as suspected from conventional epi¬ 
demiological investigations [9-11]. In-depth analysis of 
the available sequence data on SARS-CoV also revealed 
that the viral isolates could be readily subclassified into 
several major genotypes based on nucleotide variations at 
specific genomic positions [8,12]. In a large-scale phylo¬ 
genetic analysis of SARS-CoV sequences [8], a 5-nucle¬ 
otide motif at the GZ02 [GenBank :AY390556] reference 
nucleotide residues 17,564, 21,721, 22,222, 23,823, and 
27,827 was identified to be most useful for distinguishing 
the major SARS-CoV genotypes. These major viral geno¬ 
types predominated at different periods of the epidemic 
[8]. Thus, it is evident that viral sequence and molecular 
epidemiological data provide valuable information and 
tools for our combat against infectious diseases. However, 
direct sequencing of viral isolates from a large number of 
clinical samples is cumbersome and time consuming. 
Therefore, a rapid system for the characterization and 
screening of viral genotypes, such as for SARS-CoV, would 
potentially be useful. In this study, we demonstrate the 
feasibility of the adoption of allelic discrimination assays 
based on the use of fluorogenic oligonucleotide probes 
for the genotyping of SARS-CoV isolates. 

Methods 
Study population 

Viral culture isolates from 30 SARS patients who were 
admitted to the hospitals of the New Territories East Clus¬ 
ter of Hong Kong during the SARS epidemic were 
retrieved. The study was approved by the Institutional 
Review Board. SARS was confirmed in all cases either by 
positive reverse transcription-polymerase chain reaction 
(RT-PCR) detection of SARS-CoV RNA in clinical speci¬ 
mens or documented seroconversion. 

Genotype analysis by Taqman allelic discrimination assay 

We focused on the development of allelic discrimination 
assays for the five previously described characteristic sin¬ 
gle nucleotide variations (SNV) [8]. RNA was extracted 
from viral isolates cultured from SARS patients' clinical 
specimens using the QIAamp viral RNA mini kit (Qiagen, 


Valencia, CA, USA), according to manufacturer's instruc¬ 
tions. Eleven microliters of the extracted viral RNA was 
reverse transcribed by Superscript III (Invitrogen, 
Carlsbad, CA, USA) with random hexamer according to 
manufacturer's instructions. Genotyping of the five SNVs 
was determined using TaqMan (Applied Biosystems, Fos¬ 
ter City, CA, USA) allelic discrimination assays on an ABI 
Prism 7900HT sequence detection system (Applied Bio¬ 
systems). Each assay consisted of two allele-specific minor 
groove binding probes associated with either, 6-carboxy- 
fluorescein (FAM) or VIC™ as the fluorescent label, for the 
discrimination of the two respective alleles at each SNV 
locus. One assay was designed for each of the 5 SNVs. The 
primer and probe sequences, designed using the Primer 
Express 2.0 software (Applied Biosystems) are listed in 
Table 1. The probes were designed such that the discrimi¬ 
natory nucleotide is placed close to the middle portion of 
the oligonucleotide. The assays were set up according to 
the manufacturer's instructions (TaqMan Core PCR Kit; 
Applied Biosystems) in a reaction volume of 25 pL. Each 
reaction consists of IX Buffer A, 4 mM MgCl 2 , 0.2 pM 
dATP, 0.2 pM dCTP, 0.2 pM dGTP and 0.4 pM dUTP, 900 
nM forward and reverse primers, 200 nM of each fluores¬ 
cent probe, 0.25U UNG, 0.625U Taq polymerase and 0.5 
pi of cDNA as template. The thermal profile consists of an 
initial incubation at 50 °C for 2 min, and then a denatur- 
ation period at 95 °C for 10 min, followed by 40 cycles of 
denaturation at 92 °C for 15 s, and 1 min of combined 
annealing and extension at 60 °C. The genotypes were 
scored with the SDS2.1 software. 

Sequence confirmation by direct sequencing 

All viral sequences were confirmed by direct sequencing. 
RT-PCR was performed to specifically amplify genomic 
segments of SARS-CoV encompassing each of the 5 SNVs 
using primers and protocols previously described [7]. The 
DNA of each amplicon was sequenced by the dideoxy ter¬ 
minator method on an automated DNA sequencer (3100 
Genetic Analyzer, Applied Biosystems) based on capillary 
electrophoresis. 

Results and discussion 

Taqman allelic discrimination assays for the 5 SNVs were 
first tested on synthetic templates (Sigma Genosys, Aus¬ 
tralia) (Table 1) and verified using 2 viral isolates, CUHK- 
W1 [GenBank : AY278554 [ and CUHK-SulO [GenBank 
: AY282752 ], CUHK-W1 is a SARS-CoV isolate with a 
G:A:C:T:C motif at the GZ02 reference nucleotide residues 
17,564, 21,721, 22,222, 23,823, and 27,827, characteris¬ 
tic of SARS-CoV strains isolated before worldwide dissem¬ 
ination of SARS [8,13]. On the other hand, CUHK-SulO 
demonstrates a T:G:T:T:T motif which is characteristic of 
SARS-CoV strains isolated after global spread was evident. 
As evident from figure 1, the newly developed allelic 
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Table I: Primers and probes 


Sequence (all sequence starts from the 5' end) 


SNV I 7564* 

Forward primer 
Reverse primer 
Probes 

Synthetic template for allele T 
Synthetic template for allele G 


G AC ACT GT G AGT G CTTT AGTTT AT G AC A 
CCTTT GTAGAACATTTT GAAGCATT G 
FAM-AGCT G ACTT AT CCTT GT GT 
VIC-AGCT GACTTCTCCTT GT GT 

GTT G AC ACT GT GAGTG CTTTAGTTTAT GACAATAAGCTAAAAGCACACAAGGATAAGT CAGCT CA 
AT G CTT C AAAAT GTT CT AC AAAG GT GT 

GTT G AC ACT GT G AGT G CTTT AGTTT AT G AC AAT AAG CT AAAAG C AC AC AAG G AG AAGT C AG CT C 
AAT G CTT C AAAAT GTT CT AC AAAG GT GT 


SNV 21721 
Forward primer 
Reverse primer 
Probes 

Synthetic template for allele G 
Synthetic template for allele A 


CCATTTTATT CTAAT GTTACAGGGTTT CA 
TTT CT CT GT G GC AG C AAAAT AAAT AC 
FAM-ATACGTTT GGCAACCCT GT C 
VIC-ATACGTTT GACAACCCT GT C 

CTT CC ATTTT ATT CT AAT GTT AC AG G GTTT CAT ACT ATT AAT CAT AC GTTT GGCAACCCT GT CAT A 
CCTTTT AAG GAT G GTATTT ATTTT G CTGC C AC AG AG AAATC A 

CTT CC ATTTT ATT CT AAT GTT AC AG G GTTT CAT ACT ATT AAT CAT AC GTTT GACAACCCT GT CAT A 
CCTTTT AAG GAT G GTATTT ATTTT G CT G CC AC AG AG AAAT C A 


SNV 22222 
Forward primer 
Reverse primer 
Probes 

Synthetic template for allele C 
Synthetic template for allele T 


GAGCCATTCTTACAGCCTTTTTA 
G C CAACAAAAT AG G CTG CAG 
F AM -TG CT C AAG AC ACTT G G G - M G B 
VIC -TG CT C AAG AC AITT G G G - M G B 

G CC ATT CTT AC AG CCTTTTT ACCT G CT C AAG AC ACTT G G G G C AC GT C AG CT G C AGCCT ATTTT GT 
TG G CT ATTT AAAG C C AACT AC ATTT ATG CT C AAGT ATG ATG 

GCCATT CTTACAGCCTTTTTACCTGCT CAAGACAITTGGGGCACGT CAGCTGCAGCCTATTTT GT 
TG G CT ATTT AAAG C C AACT AC ATTT ATG CT C AAGT ATG ATG 


SNV 23823 
Forward primer 
Reverse primer 
Probes 

Synthetic template for allele T 
Synthetic template for allele G 


TCGCT CAAGT CAAACAAAT GTACA 
GAG G GT C AG GT AAT ATTT GT G AAAAATT 
F AM -CC AACTTTG AAAIATTTTG G 
VIC-CAACTTT GAAAGATTTTGG 

T GTT CGCT CAAGT CAAACAAAT GTACAAAACCCCAACTTT GAAAIATTTTGGTGGTTTTAATTTTT 
CAC AAAT ATT ACCT G ACCCT CT AA 

T GTT CGCT CAAGT CAAACAAAT GTACAAAACCCCAACTTT GAAAGATTTT GGTG GTTTTAATTTT7 
CAC AAAT ATT ACCT G ACCCT CT AA 


SNV 27827 
Forward primer 
Reverse primer 
Probes 

Synthetic template for allele C 
Synthetic template for allele T 


T C ATT GTTTT G ACTT GT ATTT CT CTATG C 
CTT C AAG CAC AT GAG GTTT ATT AG AT G 
FAM-TT GCATAIGCACT GTAGT 
VIC -TT G CAT AC G C ACT GT AGT 

TTCTCATTGTTTTGACTTGTATTTCTCTATGCAGTTGCATAIGCACTGTAGTACAGCGCTGTGCAT 
CTAATAAACCT CAT GTGCTT GAAGATCC 

TT CT CATT GTTTT GACTT GTATTT CT CTAT GCAGTTGCATACGCACT GTAGTACAGCGCT GT GCA 
T CTAATAAACCT CAT GT GCTT G AAG AT CC 


* probes for SNV 17564 are anti-sense sequences 


discrimination assays were able to differentiate the 2 viral 
isolates and genotype each SNV correctly (Table 1). 

Following initial development and optimization, the 
allelic discrimination assays were used to genotype SARS- 


CoV in clinical samples. We were able to successfully 
determine the SARS-CoV genotypes in all 30 samples. 
Genotypes of virus isolates at the 5 SNV positions are 
shown in Table 2. SARS-CoV from all but seven cases 
showed the T:G:T:T:T motif resembling that of the CUHK- 
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Figure I 

Allelic discrimination plot of CUHK-SulO and CUHK-WI. Allelic discrimination at each of the 5 studied SNVs 
described in the text as demonstrated using the synthetic templates and cDNA from CUHK-Su 10 and CUHK-WI Vero cell 


culture isolates is presented in the successive plots. ( ) synthetic template for the FAM-labeled allele, (0) synthetic tem¬ 


plate for the VIC-labeled allele, (® ) CUHK-WI, (• ) CUHK-Su 10, ( x ) no template control. 


Su 10 isolate. For the remaining seven cases, no allelic sig¬ 
nal was detected at SNV position 27,827 (Figure 2), lead¬ 
ing to a T:G:T:T:/ motif. 

The SARS-CoV genotypes isolated from the 30 patients 
were also confirmed by direct sequencing. The sequencing 
results were fully concordant with that based on the allelic 
discrimination assays at all the 5 SNVs. The seven samples 
which gave no allelic signal by the allelic discrimination 
assay at SNV 27,827 showed a shortened amplicon 
encompassing the region. Direct sequencing of this short 
amplicon revealed a deletion of 386 nt identical to a 
SARS-CoV deletion variant previously reported by our 


group [8,9]. This deletion variant was first isolated from a 
discrete cohort of 15 epidemiologically related SARS 
patients [9]. In the previous cohort of patients, the origin 
of the deletion variant was traceable to mid-April 2003 in 
two patients residing in an estate, T, in Hong Kong with 
subsequent spread predominantly at the North District 
Hospital, Hong Kong [9]. To further determine if the 
newly identified cases were epidemiologically related to 
the original patient cohort, the case histories were 
reviewed. 

The seven patients had fever onset between April 4 to 15, 
2003 which predated the disease onset dates of all cases in 
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SNV 27827 
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Figure 2 

Allelic discrimination plot of 30 SARS patients using 

the allelic discrimination assay on SNV 27827. (0 ) 

synthetic template for FAM-labeled allele, (0) synthetic 
template for VIC-labeled allele, (* ) patients with SARS-CoV 

showing a T nucleotide at SARS-CoV SNV 27827, ( ) 
patients with SARS-CoV showing no allelic signal at SNV 
27827. This SARS-CoV variant was confirmed to contain a 

386 nt deletion encompassing SNV 27827, ( x ) no template 
control. 


the original patient cohort. All but one patient presented 
to hospital A initially for other medical conditions with¬ 
out fever or evidence of chest infection and appeared to 
have acquired SARS nosocomially during admission. The 
remaining patient was a resident at Estate T and presented 
to hospital A with symptoms and signs of chest infection 
on April 8, 2003 and SARS was subsequently confirmed. 
Thus, it appeared that we have identified another cohort 
of patients harboring the SARS-CoV variant with the 386- 


nt deletion. It is interesting to note that this study pro¬ 
vided additional anecdotal evidence pointing to Estate T 
as a propagation site for the deletion variant. In addition, 
we were able to trace the emergence of this deletion vari¬ 
ant to early April 2003, weeks before the first appearance 
reported previously [9]. 

Our study has clearly demonstrated the feasibility of using 
allelic discrimination assays as a method for genetic char¬ 
acterization of SARS-CoV genotypes in patients. It is 
particularly useful when there is already extensive 
sequence information. Direct sequencing is still the gold 
standard for identifying new sequence variations when 
new agents of infectious disease continue to emerge and 
old ones reemerge. Once the variations have been identi¬ 
fied, allelic discrimination assay is more efficient and suit¬ 
able for large-scale population investigations. A recent 
study illustrated the use of mass spectrometry-based tech¬ 
nology in characterizing SARS sequence variations [14]. 
However, this method requires post-PCR manipulations 
and the availability of specialized equipment. On the 
other hand, allelic discrimination assays have been widely 
used in the study of associations between single nucle¬ 
otide polymorphisms and diseases such as cancers [15] 
and rheumatoid arthritis [16]. The validity of the 
approach for single nucleotide polymorphism genotyping 
has been previously demonstrated [17-19]. Thus, this 
study further extended the usefulness of allelic discrimina¬ 
tion approach based on fluorogenic oligonucleotide 
probes. The approach provides a rapid and simple means 
to accurate genotype screening, making it ideal for epide¬ 
miological investigations. 

Conclusion 

We have evaluated a rapid approach for characterizing 
SARS-CoV genotypes. The assay is simple, easy to perform 
and reproducible. It can therefore be used as an efficient 
means to screen for virus genotypes and track the trans¬ 
mission of a particular viral strain in times of epidemics. 
Incidentally, we identified a previously reported deletion 
variant of the SARS-CoV in a new cohort of patients and 
traced the emergence of this variant to an earlier date than 
previously reported. 

List of abbreviations 

SARS: Severe acute respiratory syndrome 

SARS-CoV: SARS-coronavirus 

RT-PCR: reverse transcription-polymerase chain reaction 
SNV: single nucleotide variation 
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Table 2: Genotype of SARS-CoV culture isolates from 30 patients determined by Taqman Allelic Discrimination assays 


Case number 



Nucleotide positions 




17,564 

21,721 

22,222 

23,823 

27,827 

TC0I 

T 

G 

T 

T 

T 

TC02 

T 

G 

T 

T 

T 

TC07 

T 

G 

T 

T 

T 

TC10 

T 

G 

T 

T 

T 

TCI 1 

T 

G 

T 

T 

T 

TC 14 

T 

G 

T 

T 

T 

TC 19 

T 

G 

T 

T 

T 

TC20 

T 

G 

T 

T 

T 

TC22 

T 

G 

T 

T 

T 

TC26 

T 

G 

T 

T 

T 

TC27 

T 

G 

T 

T 

T 

TC28 

T 

G 

T 

T 

T 

TC29 

T 

G 

T 

T 

ND 

TC30 

T 

G 

T 

T 

T 

TC32 

T 

G 

T 

T 

ND 

TC34 

T 

G 

T 

T 

T 

TC37 

T 

G 

T 

T 

T 

TC38 

T 

G 

T 

T 

T 

TC39 

T 

G 

T 

T 

T 

TC4I 

T 

G 

T 

T 

T 

TC42 

T 

G 

T 

T 

T 

TC43 

T 

G 

T 

T 

T 

TC44 

T 

G 

T 

T 

T 

TC45 

T 

G 

T 

T 

ND 

TC46 

T 

G 

T 

T 

ND 

TC48 

T 

G 

T 

T 

T 

TC5I 

T 

G 

T 

T 

ND 

TC52 

T 

G 

T 

T 

T 

TC55 

T 

G 

T 

T 

ND 

TC59 

T 

G 

T 

T 

ND 

CUHK-WI 

[AY278554] 

G 

A 

C 

T 

C 

CUHK-Su 10 
[AY282752] 

T 

G 

T 

T 

T 


ND: no allelic signal 
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